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Estimation of clinical cut-offs 



The present invention concerns methods and systems for improving the accuracy of 
5 predicting resistance of a disease to a drug. More specifically, the invention provides 
methods for assessing the impact of pre-existing variations in drug susceptibility, 
whether naturally occurring or selected by previous drug exposure, on treatment 
response in order to establish clinically relevant cut-off values for phenolypic or 
genotypic resistance tests, 

10 

All publications, patents and patent applications cited herein are incorporated in fldl by 
reference. 

Techniques to detennine the resistance of a disease to a drug are becoming increasingly 
15 important. Since the issuance of the first report suggesting a correlation between the 
emergence of viral resistance and clinical progression, techniques to determine the 
resistance of a pathog^ to a drug have been increasingly incorporated into clinical 
studies of therapeutic regimens (see Brendan Larder et al., HIV Resistance and 
Implications for Therapy (1998), herein incorporated by reference). For example, as 
20 with viral infections, some studies show that p53 mutations may also be predictive of 
tumour response to specific anticancer drug therapy, radiation treatment or gene 
therapy. This is the case in breast cancer where initial studies have shown that cisplatin 
and tamoxifen are more effective in patients whose tumoms have a p53 mutation. Thus, 
the aim of resistance monitoring is to provide the necessary information to enable the 
25 physician to prescribe the most optimal combination of drugs for the individual patient. 

With more therapeutic options becoming available over time, resistance testing is 
expected to play an important role in the management and treatment of disease and the 
development of individualized treatment regimes [see e.g. Haulbrich et al JADDS, 
30 2001,26S1, S51-S59]. 

Furthermore, the number of drug resistant diseases is also increasing. Phenolyping 
methodologies measure the ability of a pathogen to grow in the presence of different 
drugs in the laboratory. This is usually expressed as the fold change in the IC50 or IC90 
35 values (the IC50 or IC90 value being the drug concentration at which 50% or 90% 

respectively of the population of pathogen is inhibited from replicating). For example, a 
highly resistant virus might show a 50 or even 100-fold increase in ICso, for example. 
Some viral mutations only increase the IC50 by as little as 2-3 fold. On the other hand, a 
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pafhogen may exhibit hypersensitivity towards a given drug. For example, it has been 
demonstrated that a combination of HIV mutations may lead to hypersensitivity of the 
pathogen towards a given dmg. 

5 Unlike genotyping, phenotyping is a direct measure of susceptibility, reflecting the 
effects and interactions of all the mutations, known or unknown, on the behaviour of 
the pathogen population in the presence of a drug. 

The utility of drug susceptibility phenotyping is dependent on the "cut-off" value of the 
10 fold increase in, for example, the IC50 at which a pathogen is considered resistant. The 
term "cut-off value", as xised herein, refers to the threshold change in susceptibility 
above which a pathogen is classed as having reduced susceptibility for a particular 
dmg. Dmg "resistance", as used herein, pertains to the capacity of resistance, 
sensitivity, susceptibility or effectiveness of the dmg against the pathogen, 

15 

There has been recent debate regarding the relevance of some cut-off values currently 
in use. For example, for viral infections, certain groups currently use technical cut-off 
values, which are usually the same value for each drag-tested and are determined not 
by clioical criteria but, for example, by the assay variability seen on repetitive testing of 
20 a single wild type standard viras. By repeatedly running a test with the standard 

reference vims, the reproducibility of the test is measured and a cut-off is set at this 
level, (e.g., a 2,5-fold increase in IC50)- This provides a cut-off that depends largely on 
the analytical performance of the assay. This approach suffers fi:om the limitation that it 
does not consider the population-based variation in drag responsiveness. In addition, 
25 such an approach does not account for different responsiveness towards different drag 
regimens. The limitations of setting a single cut-off for all available drags in this way is 
that it tells the clinician very little about the significance of any change in susceptibility 
reported by a test Indeed, some virological cut-off values are clearly out of line with 
known response data. For example, indications of low level resistance to non- 
30 nucleotide reverse transcriptase inhibitors (NNRTIs) does not lead to blunted responses 
to drags in previously untreated individuals (Harrigan et aL, Bacheler et ah^ 4th 
Ditemational Workshop on HIV Drag Resistance and Treatment Strategies, Sitges, 
Spain. Abstr. (2000)), Other assays have cut-off values that are primarily based on the 
reproducibility of the assay, are the same for each dmg, or are not related to whether a 
35 drag might work against the pathogen in clinical practice and are, therefore, rather 
arbitrary. 

Methods have already been described to develop more meaningful, biologically 
relevant cut-off values for drags used in HIV therapy. For example, Virco measured the 



wo 2005/086061 



PCT/EP2005/050888 



ICso values for isolates fiom 1,000 untreated patients as well as many thousands of 
samples of HIV-l with no resistance mutations. The average and the range of 
susceptibility were calculated for each drug. The cut-offs were then set at two standard 
deviations above the mean. This statistical term means that a lest result falling above 
5 the cut-off can be said to be above the normal susceptible range with 97.5% confidence 
(Harrigan et al World-wide variation in HIV-l phenotypic susceptibility in untreated 
individuals: biologically relevant values for resistance testing. 2001. ADDS 15:1671- 
1677). Since the susceptibility of untreated and un-mutated virus varied considerably 
flx)m drug to drug, the predicted biological cut-offe are different for each drug, 

10 

The use of biological cut-oflfe has changed the amount of resistance being reported for 
HIV. For example, the biological cut-off values for the dideoxynucleoside analogues 
are lower than the cut-offs used previously and, in a study of 5,000 random clinical 
samples, revealed a higher and more realistic incidence of resistance. Conversely, the 
15 cut-offs for the non-nucleoside reverse transcriptase inhibitors are higher than those 
previously used. 

However, although the biological cut-off values are a vast improvement to the arbitrary 
cut-offs used previously, there are still disparities between these predicted thresholds 
20 and the observed fold-resistance above which a clinical response is actually reduced. 

There is thus a great need for a method that can establish cut-off fold change resistance 
values that are clinically-relevant. 

The present invention provides a solution to these problems, in the form of new 
25 methods for assessing the impact of pre-existing variations in drug susceptibility, 
whether naturally occurring or selected by previous drug exposure, on treatment 
response ia order to establish clinically relevant cut-off values for phenotypic or 
genotypic resistance tests, 

30 Summary of the invention 

According to the invention, there is provided a diagnostic method for estimating for a 
patient the treatment response of a disease caused by a pathogen to a drug, the method 
con^rising: 

comparing the fold change resistance value of the pathogen infecting the patient to a 
35 clinical cut-off value which is the fold change resistance value at which a clinically 
relevant variation of clinical response is observed; 
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wherein the clinical cut-off value is established by modelling the clinical lesponse of a 
population of patients treated with the drag to the disease caused by the pathogen as a 
function of the fold change resistance of the palhogen infecting the patients. 

5 According to the invention, a threshold jfold-resistance is established, above which a 
disease is classified as being resistant to a drug ia a clinical context. The method 
models treatment response of the pathogen causing the disease to a particular drag as a 
fimction of baseline pathogen load, baseline resistance, baseline activity of co- 
administered drags targeted to the pathogen and treatment history. By '"baseline 

10 pathogen load" is meant the pathogen load of the patient measured at the start of 

treatment by the drag. By *l)aseline fold change resistance" is meant the fold change 
resistance to the candidate drag exhibited by the pathogen infecting the patient at the 
start of treatment By ^^baseline activity of co-administered drags targeted to the 
pathogen" is meant the activity against the pathogen of each drag administered in 

15 combination with the drug for which the treatment response is being modeled. By 

'^treatment history" is meant the previous drag exposure of the patient (and therefore, 
the pathogen). 

In a preferred embodiment, the cut-off value is determined as a function of treatment 
20 response data in treated subjects, considering baseline pathogen load, baseline fold 

change resistance, baseline activity of co-administered drags targeted to the pathogen, 
and treatment history. 

This method thus provides a prediction of clinical outcome at different levels of 
25 baseline resistance. According to this methodology, treatment outcome (drop in 

pathogen load and response rate) is modeled by drag as a function of baseline fold 
change resistance as determined by reference to a system that measures drag resistance 
phenotype or predicts drag resistance phenotype from pathogen genotype (such as 
VirtualPhenotype®, Virco). The models take into account effects of co-administered 
30 drags, baseline pathogen load and, optionally, treatment history in order to avoid any 
bias introduced by imbalances of clinically-important characteristics. From the model, 
a prediction of outcome can be made at diluent levels of the baseline fold change 
resistance of the pathogen. 

35 Using this methodology, fold change resistance values obtained by coirparison of 
genotype with phenotype (for example, VirtualPhenotype®) are Hnked with clinical 
outcome. This is a unique approach; other research groups use different approaches 
whereby particular mutations or actual phenotype results are linked with clinical 
outcome. 
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The methodology of the invention is advantageous over those currently used. For 
example, conventional approaches do not fully account for the population-based 
variability in drug sensitivity. In the present method, the population may include 
5 treatment najfve and treatment experienced patients, and may be a mixed population 
which is not restricted to, for example, a single gender, age, race or sexual behaviour. 

The method of the invention also accounts for the different responsiveness in a 
population towards different drags. The drug-specific clinical cut-off values determined 
10 by this approach are more reliable parameters in estimating resistant over sensitive 
strains of pathogen. 

The method also allows clinical cut-o% established using the method to be re- 
calculated depending on the type of population studied, Le. a paediatric population may 
1 5 have a different clinical cut-off for a particular drag than the adult population for flie 
same drag. 

Of particular importance, this methodology allows the determination of clinical cut-offs 
for all marketed drags in a uniform, scientific manner on a substantial database using 
20 data derived fixjm resfponse to combination therapy. Currently available cut-of & are 

determined by reference to a limited amount of data and may be inconsistent as they are 
determined using different approaches. 

•at 

According to the invention, clinical cut-off values are established by modeling the 
25 clinical response of a population of patients treated with the drag to the disease caused 
by a particular pathogen as a fimction of the fold change resistance of the pathogen 
infecting the patients. The fold change resistance for a pathogen may be established 
using methods known in the art. Briefly, the sensitivity of a patient sample for a 
particular drag is compared with the sensitivity of a reference sample for that same 
30 drag- This may be done by a) determining the sensitivity of a patient sample for the 
drag; b) determining the sensitivity of a reference sample for the drag; and c) 
determining the patient fold change resistance firom the quotient of the sensitivity 
obtained in step a) over the sensitivity obtained in b). Examples of preferred methods 
for performing these steps are described in detail in co-pendiag appUcations 
35 WOOl/79540 and WO02/33402. Equivalent methods will be apparent to the person of 
skill in the art. 

In a preferred embodiment of the invention, the cut-off fold change resistance value is 
calculated by reference to the log of Ihe pathogen load drop. In such a method, a linear 
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regression analysis is preferably performed using a set of treatment response data from 
subjects harbouring the pathogen, wherein the log palfaogen load drop LogPL dropu for 
the pathogen infecting a patient is modelled as the sum of all of the individual 
contributions for &ctors that influence pathogm load drop, according to the following 
S equation: 

LogPLdrop^ = Po + ^rLog(BaseUnePL,)+ ^^{PSS;)-^ %i\IFC.)+z^ 

In this equation, BaselinePLi represents the pathogen load of the patient measured at 
the start of treatment by the drag. 

PSSi is a phenotypic sensitivity score representing the number of active drags in the 
10 background treatm^Lt regimen fer Ihe patient, excluding the drag whose contribution to 
treatment response is being modelled. 

FCi is a baseline fold change resistance. 

Po is the intercept 

Pi is a coefficient representing the increase in log pathogen load drop per unit increase 
15 of the log of the BaselinePLu In the case of HIV and HCV infectiori, baseline PLi is 
readily quantified by validated commercial assays. 

>?2 is a coefficient indicating the increase in log pathogen load drop per unit increase of 
the number of sensitive drags in the background treatment regimen. 

rt.v Ps is a coefficient indicating the increase in log pathogen load drop per unit increase of 
20 the inverse of FCu The value of this coefficient is part of the output of the described 
model. 

e,; is an error term which represents the difference between the modelled prediction and 
the experimentally determined measurement 

25 The coefficients in the linear regression model may be calculated using a computer 
* analysis package such as PROC LIFEEIEG. PROC LBFEREG is procedure within Ihe 
SAS (Statistical Analysis System) software which performs linear regression on 
censored data. By de&ult, the LIFERBG Procedure computes initial values for the 
parameters using ordinary least squares (OLS) ignoring censoring. The log-likelihood 

30 function is naaximized by means of a ridge-stabUized Newton-Raphson algorithm. 

PSSU the phenotypic sensitivity score, represents the number of active drags in the 
background treatment regimen for the patient, as predicted from pathogenic genotype 
by FJrtwaiPhenolype™ or other algorithms or as measured by actual phenotype testing. 
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The purpose of this term is to aUow a drug-specific value to be extracted from 
treatment response data that has been collected for a patient that has received a 
combination of drugs. In this way, resistance data relevant solely to the particular drug 
under investigation is extracted. The other drags are consid^ed the background 
5 regimen; this may be different for different patients. It is necessary to analyze patients 
with different background regimens together as there would not be enough data to do a 
sound analysis otherwise. 

During Ihis analysis it has to be taken into account that different background regimens 
10 influence the clinical outcome in a different way. In order to do this, the activities of 

background drags are summarised, by determining the number of active drugs, and thus 
devising a PSS (preferably judged as active according to VirtualPhenotype®). The PSS 
is then included in the model. 

15 In a preferred embodiment, the PSS may be calculated based on preliminary clinical 
cut-oJ% which are determined as described. The concept of PSS is discussed in detail 
by DeGrattola et at (Antiviral Therapy 2000; 5:41-48). In addition, the concept of 
continuous PSS as a variation of PSS is discussed by Bosch et at (AIDS 2003, 17:1-9); 
Katzenstem et al (AIDS 2003; 17:821-830); and Haiihrich et al CT)elavirdine 

20 Hypersusceptibility (DLV HS): Virological Response and Phenotypic Cut-Points — 
Results from ACTG 359"; 1 1th Conference on Retrovirases and Opportunistic 
Infections held on 8-1 1 February 2004 in San Francisco, CA, USA). The PSS may be 
determined by an iterative process such tiiat the cut-off value is refined to a constant 
value. In subsequent iterations of the model, PSS scores based on preliminary clinical 

25 cut-offs defined in the first iteration of the model may be utilized. 

FCi, the baseline fold change, is equivalent to baseline fold change resistance. These 
terms are used interchangeably herein. This is a patient-specific term and is determined 
based on a drug susceptibility phenotype test or predicted based on the genotype of the 

30 pathogen infecting a particular patient. The phenotype exhibited by the pathogen of this 
genotype may be predicted in a number of ways; generally, such techniques compare 
the genotype to phenotype data collected ftom a group of patients infected with a 
pathogen of similar genotype. However, this does not change the fact that this fold 
change resistance is a characteristic of the specific pathogenic strain infecting an 

35 individual patient at baseline. 



For example, prediction of baseline fold change resistance may e^qploit rules-based or 
other less direct systems of determining the drug resistance phenotype of a pathogen. 
An example of a less direct system is the Virtual Phenotype (Virco, Inc.; 
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PCT/EPOl/04445). Prediction of baseline fold change resistance may alternatively use 
other systems for determining phenotype from genotype information, such as neural 
networks that determine the drug resistance phenotype of a pathogen based on its 
genotypic information (see, for example, U. S. Patent Application No. 09/589,167; 
5 PCT/EPOl/06360, The neural network may be used to identify mutation(s) or mutation 
patterns that confer resistance to a drag and defines the genetic basis of drag resistance, 

P09 the intercept, is the estimated log pathogen load drop for a reference group i.e. a 
theoretical group of patients with a baseline pathogen load of one, an infinite fold 
10 change resistance and no sensitive drags in the background. The purpose of this term is 
to improve the model fit If it was not included, the fitted curve woidd be forced to pass 
through the origin (zero Log PL drop at zero fold change resistance), which could lead 
to an unrealistic model. 

1 5 The error term, represents the difference between the modelled prediction and the 

experimentally determined measurement ie. the difference between the actual response 
of the patient and the predicted response. As more data are added to the model, 
additional factors that are relevant to the determination of clinical cut-off values will be 
added. This will improve the model fit and therefore the error of the prediction will 

20 decrease. All the P terms are estimated simultaneously by minimizing the error term. 

In this methodology of this embodimmt of the invention, censoring (pathogen loads 
beyond the assay range caused by the detection limits of pathogen Idkd kits) affects the 
results and therefisre procedures that take censoring into account are preferably applied. 

25 Preferably, censored values are dealt with by attempting to constract a model that is 

consistent firom extrapolations. This model is applicable to any described methodology. 
Censored values are thus modelled by replacing the censored value by a maximum 
likelihood estimation, assuming knowledge of the standard deviation of the 
measurement error. For example, censored values may be dealt with using the PROC 

30 LEFEREG pre-programmed procedure in the statistical analysis package SAS that 
performs analyses with censored values. 

An advantage of the linear regression method described above is that quantitative data 
about changes in pathogen load can be studied because pathogen load is considered as a 
35 continuous variable. This therefore takes into account the maximum amount of 

information present in the data. Estimates are corrected for covariates in the model (for 
example, background regimen) and therefore, do not suffer flrom imbalances in the 
covariates. Conclusions are limited to patients with covariates that are represented in 
the dataset in the clinical response database. 
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Other baseline chaxacteristics may be added to the linear regression if relevant, 
resulting in the addition of new tetms in the equation given above. Examples of 
additional baseline characteristics include the total duration of the previous treatment, 
and the time at which treatments were administered For example, estimates can be 
S corrected for duration by adding a term P4(Duration) in the model equation given 
above. 

Furthermore, additional &ctors may be taken into account, including sensitivity score 
per drug class (in addition to the overall sensitivity score of the background treatment: 
10 cPSS), previous exposure to the drag (naive: Yes or No; naive to protease inhibitors: 
Yes or No; naive to nucleotide RT Inhibitors: Yes or No; and so on). Further examples 
will be clear to those of skiU in tiie art. 

A quadratic term for the cPSS may be added to the model. 

15 

The fold change resistance may be transformed before putting it into the model. For 
example, a power transformation ranging from FC"^ to FC^ may be performed on the 
fold change. 

20 Accordingly, a more general form of the equation presented above may be expressed 
as: 

LogPLdrop, = po + ^,Log(BaselinePL,)^ ^^icPSS,)^ ^icPSS.f + ^^{FCy + P^C^^)^ p„(fi^„) + 8, 

25 wherein p is a power transformation (e.g. ranging from -3 to 1) and H5 to Hnare 
treatment history parameters (e.g. naJve to antiretroviral therapy, naive to NRTl 
treatment, etc. ...) or parameters describing the background therapy as a &nction of a 
certain therapeutic class (e.g. the number of active NRTTs taken concomitanfly with the 
drag under investigation). 

30 

An example of characteristics of analysis datasets (8 week outcome) for individual 
drags ate in the following form: 





Ranee (Drug) 


Median Baseline Viral Load (log) 


3.32 (TDF) - 4.71 (boosted IDV) 


Median background cPSS 


1.34 (ddC) - 2.58 (LPV/r) 


# regimens including the drag 


24 (unboosted APV) - 1551 (3TC) 


% from cohort data 


21% (unboosted APV) - 83% (ddl-EC) 


% with no resistance mutations 


14.5% (boosted APV) - 75% (EFV) 
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As stated above, the clinical cut-of & detertnine the fold change resistance with a 
diminished predicted clinical response to drug. In an alternative to merely classifying 
pathogens as sensitive or resistant, the melhod of this aspect of the invention preferably 
incorporates three classifications, namely "sensitive", associated with maximum 
5 response to drug therapy, "intermediate", associated with reduced, but still significant 
response to drug therapy, and "resistanf , associated with little if any response to drug 
therapy. For example, by one set of definitions relevant for HIV response, "sensitive" 
may be classified as a predicted pathogen load drop of more than about 0,6 logs, 
"intermediate resistance" may be classified as a predicted pathogen load drop of 
10 between about 0.2 and about 0.6 logs and '^sistanf ' may be classified as a predicted 
pathogen load drop of less than about 0.2 logs. la another set of definitions, "sensitive" 
may be classified as a predicted pathogen load drop of between about 0.5 logs and 1 .0 
logs. Cut-ofiOs calculated using these definitions are Mgbly dependant on covariates. 

15 In a further preferred embodiment of the invention, the cut-off fold change resistance 
value is calculated by reference to the probability of the pathogen being susceptible to 
treatment by the drag for the patient, herein termed Prob of success. In such a method, 
Prob of success is preferably calculated by perfiirming a logistic regression analysis 
using data fix)m a clinical response dataset, wherein Prob of success is modelled 

20 according to the following equation: 
Prob of 

exp(p, -I- ^,LogiBaselimPL,^ + ^^{PSS,)^^,{\/FC,)) 

ri,: 1 success • 

(l + expCPo + ^^LogiBaselinePL,)^ %{PSSj% PsCl/FC,))) 

The terms in the equation are the same as those described above for the embodiment of 
25 the invention described above. 

This method of logistic regression does not suffer from the censoring problem 
described above for the linear regression model Furthermore, the probability of success 
is an intuitive way of ioterpreting clinical outcome. One disadvantage is that by 
30 classifying the pathogen load into successes and failures, part of the information of the 
continuous variable pathogen load is lost. 

Estimates may also be corrected for covariates as for linear regression. 

35 Again, like the method of the first described embodiment of the invention, the method 
of second described embodiment also preferably incorpomtes fiie three classifications, 
sensitive, intermediate resistant and resistant On the basis that the maximum effect is 
defined as the treatment effect at a fold change resistance of approximately 1 fold 
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change or the fold change demonstrated by wild type vital isolates firom patients, and 
the mitiiTm im effect is defined as the treatment effect at a very high fold change 
resistance (i.e. when the curve reaches a plateau), the "effect range" is the difference 
between the maximum effect and the minimum effect. The maximum effect may be 
5 defined as the treatment effect at fold change resistance of between about 0.7 and about 
1.2 fold change resistance. 

Preferably, a "sensitive" genotype is classified as a predicted treatment effect of more 
than about 78-85% of the effect range. Preferably, "intermediate resistanf is classified 
as a predicted treatment effect of between about 15-25% and about 75-85% of the 
effect range. Preferably, **resistant" is classified as a predicted treatment effect of less 
than about 15-25% of the effect range. Cut-offs calculated using this method are less 
dependent on covariates than the method described earlier which uses predicted 
pathogen load drops. However, the effect range will vary for different covariates. 

With this approach, two cutoff per drug are identified: a "lower" cutoff which 
represents the fold change at which Ihe response begins to be lost, and an '\tppef" 
cutoff which represents the fold change at which the resfponse is essentially gone. 

The lower and upper cutoff may be defined as the fold change with expected log viral 
load drops of >0.6 and <0.2 resfpectively. 

The lower and upper cutoflS may be defined as the fold change associated with an 
expected 20% and 80% decrease respectively of the reference activity of the dmg 
vdthin the regimen. 

Accordingly, a first definition (definition 1) of lower and upper cutoffi; are the fold 
changes with expected log viral load drops of >0.6 and <0.2 respectively. 

A second definition (definition 2) of the lower and higher cutof6 are the fold changes 
associated with an expected 20% and 80% decrease respectively of the reference 
activity of the drug within the regimen. 

A third definition (definition 3) of Ibe cut-off is the fold change that most optimally 
distinguished between patients with successfid and unsuccessfiil treatments. 

In one embodiment of the invention, definition 1 is applied for Tenofovir on a 
population taking two active drugs besides tenofovir and with a baseline Log(Vl) of 4, 
gives an predicted drop in log viral load of -0.6 at fold change 3.73. 
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If definition 2 is applied on the linear regression model, the predicted drop in log viral 
load may be -1,48 at fold change 1, and -0.28 at the maximum fold change. Therefore 
the effect range may be -0.28 + 1.48 = 1.2. 20% of this effect range was observed at 
fold change 5 (and this value was considered as the upper clinical cut-off value). 80% 
5 of the effect range was observed at fold change 1 .25 (and this value was considered as 
the lower clinical cut-off value). 

For this embodimenii when the FC of patient is 0.8 (below the lower clinical cut-off), a 
normal clinical response is predicted. If the FC of the patient is 2 (above the lower 
1 0 clinical cut-off and below the upper clinical cut-oflE), a reduced clinical response is 
predicted If the FC is 7 (above the clinical cut-off), then the clinical response is 
predicted as being minhnal. 

In an embodiment when definition 2 is applied, and the logistic model is used for 
15 tenofovir, a lower cut-off at 1.2 FC and a higher cut-off at 3.81 FC are determined 



The following table depicts a number of embodiments of the invention for tenofovir for 
a population with 2 active drugs in the regimen and a baseline Log(VL) of 4. 



Definition of 






Unear 
Regression 


Logisffc 
i^egression 


CiassHication 
Tree 


Clinical 
cut-off 


Population 


ProperOes of the subgroup 


Lower 
CO 


Higher 
CO 


Lower 
CO 


Higher 
CO 


Lower 
CO 


Higher 
CO 


Deffnition i 


Subgroup 1 


PSS=2, baseline Log(VL) = 4 


3.73 


> assay 
limit 


NA 


NA 


NA 


NA 




Subgroup 2 


PSS=0, baseline Log(VL) = 4 


1.68 


3.8 


NA 


NA 


NA 


NA 




Subgroup 3 


PSS^2, baseline Log(VL) - 5 


> assay 
limit 


> assay 
limit 


NA 


NA 


NA 


NA 




Overall 




NA 


NA 


NA 


NA 


NA 


NA 


DeffniHon 2 


Subgroup 1 


PSS=2, baseline Log(VL) = 4 


1.25 


5 


1.2 


3.81 


NA 


NA 




Subgroup 2 


PSS=0, baseline Log(VL) = 4 


1.25 


5 


1.16 


3.36 


NA 


NA 




Subgroup 3 


PSS-2, baseline Log(VL) = 5 


1.25 


5 


1.17 


3.4 


NA 


NA 




Overall 




1.25 


6 


NA 


NA 


NA 


NA 


Definition 3 


Subgroup 1 


PSS=2, baseline Log(VL) = 4 


1.1 


5 


1.2 


3.81 


1.15 


IMA 




Subgroup 2 


PSS=0, baseline Log(VL) = 4 


1.1 


5 


1.2 


3.36 


1.15 


NA 




Subgroup 3 


PSS-2, baseline Log(VL) = 5 


1.1 


5 


1.2 


3.4 


1.15 


NA 




Overall 




1.1 


5 


1.2 


NA 


1.15 


NA 


NJA: Not Applicable 





20 
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In a furfher embodiment, ^yplying definition 1 on the linear xegiession model for 
patients with a log baseline viral load of 5 and all patients taking two active drugs in 
addition to d4T(stavudine), a viral load drop of more than 0.6 log copies/mL for any 
fold change of d4T is predicted The viral load drop is predicted to be -0.6 logs and — 
5 0.2 logs at fold changes 2.6 and 4.0 for patients with a log baseline viral load of 5 and 
taking no active drags in addition to d4T- 

In another embodiment, lower and upper cutoff predicted using definition 2 for 
lopinavir/r are 8 and 69 respectively for the whole population if viral load is modeled 
10 using linear regression, and the lower and iipper cuttofe are 1 1 and 64, 10 and 60, and 9 
and 58 respectively for populations with log baseline VL/background PSS of 4/2, 5/0 
and 5/2 respectively if the &ilure rate is modeled using logistic regression. 

In anoth^ embodiment, lower and upper cutoffs determined using definition 2 for 
15 boosted saquinavir for the logistic model are 1.7 and 13,2, and 1.7 and 12.9 

respectively for populations with log baseline viral load/phenotypic sensitivity score for 
the background regimen of 4/2 and 5/0 respectively. In the same circumstances lower 
and upper cutoffs by linear regression for saquiavir/r are 1.6 and 12.3 respectively for 
the whole population. 

20 

The following table shows a number of embodiments modelled using a preliminary 
linear regression analysis and definion 2: 



DRUG 


VIROLOGIC RESPONSE 


Baseline FC for 20% 
REDUCTION of 
response(lower cutoff) 


Baseline FC for 80% 
REDUCTION of response 
(upper cutoff) 


AZT zidovudine 


1.8 [1.5-2.5] 


17 [10-25] 


3TC lamivudine 


1.1 [1.1-1.2] 


2.6 [1.9-4.6] 


D4T stavudine 


1.3 ri.2-1.41 


3.4 [3.1-3.6] 


Ddl didanosine 
(extended release) 


1.3 [1.2-1.9] 


3.6 [2.8-4.9] 


ABC abaca>dr 


1.6[1.1-2.ei 


5.8 [1.7-7.4] 


TDF Tenofbvir 


1.2 [1.1-1.51 


2.5 [1.7-3.8] 


IDV indinavir 


1.2 [1.1-1.9] 


3.4 [1.9-16.4] 


IVO/r Indinavlr/r 


3.5 [1.1-8.4] 


25 [1.8-31] 


NFV nelfinavir 


1.1 [1.1-1.3] 


2.2 [1.7-5.3] 


SQV saquinavir 


1.1 [1.1-2.1] 


2.0 [1.7-18] 
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DRUG 


VIROLOGIC RESPONSE 


Dciod lilts ¥Oi £v/o 

REDUCTION of 
responseOower cutoff) 


Evcioeiiiie r w Tor ou /o 

REDUCTION of response 
(upper cutoff) 


SQV/r Saquinavir/r 


1.6 [1.3-4.81 


12 [5.8-27] 


AMP amprenavir 


1.2 [1.1-2.41 


3.4 [1.7-10.21 


AMP/r amprenavir/r 


1.5 [1.2-2.61 


6.8 [3.6-10.5] 


LPV/r Lopinavir/r 


6.9 [2.1-17.4] 


56 [29-67] 



The values between brackets in fhe table are the 90% confidence limits as determined 
by bootstrapping. 



5 In another embodiment, lower clinical cutoffs modelled using definition 2 after 8 weeks 
were 1.1 to 1,2 for unboosted Pis, and upper clinical cutoffs were 2.0- 3.4 for 
unboosted Pis. Clinical cutoff for boosted Pis were higher: lower CCO - 1.5 -6.9 and 
upper CCO 6.8-56. 

10 In an example for £>4T the linear regression model conqpiises: 

VLdrop = Z91 ~ 0.63Log(BaselineVL) - 1.66(FC^^) - 0.99(cPSS) + 0J5(cPSSF) 
- 0J8(PSS[NRTri) + 0.91(NRTI[naive]). 

The coefficients in the linear regression model were c^culated using PROC LIFEREG, 

The NRTI[naive] value represents whether the patient is naive to nucleoside RT 
15 inhibitors. If yes, value =1, if no, value =0. The PSS[NRTr} represents the phenotypic 
sensitivity score for NRTIs, i.e Ihe number of active NRTIs in the background regimen 
&r the patient. 

When Log(BaselineVL) = 4, lower reference fold change =0.9 and upper reference fold 
change=3, cPSS=2, PSS[NRTr|=l and NRTI[naive]=0, the modelled lower and upper 
20 CCOs are modeled as 1 .1 and 2,2 respectively using definition 2. 

In another example, the linear regression model is applied to an 8 week viral load 
response and modelled as a function of baseline phenotypic resistance. The subjects 
may have a log baseline viral load = 4.0 and a cPSS score for the background regimen 
of2.0. 



25 



In a further preferred embodiment of the invention, the cut-off fold change resistance 
value is calculated by constructing a classification tree in order to classify the 
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likelihood of a patient having an undetectable pathogen load aflter treatment with a 
particular drug, as a success or a failure. This methodology constructs ttee-structured 
rules in order to classify patients as successes (undetectable pathogen load after 
treatment) and failures. For example, for a virus an undetectable pathogen load could 
5 be defined as a viral load of less than 400 viral copies per ml. Such a classification tree 
has the advantage that it is very visual and easy to interpret, although it suffers firom the 
limitation that the decisions do not take into account the value of certain other relevant 
parameters- Imbalances for such parameters may therefore influence the decision taken 
for a certain parameter. However, such trees provide insigihts into the importance of 
1 0 several parameters and this can be helpftd in the fitting process of the linear regression 
and logistic regression approaches described above. 

The classification tree poses queries, in which the answer to each query results in either 
the left or the right branch of the tree being taken at each stage. For example, the first 
query may preferably consider the fold change resistance of the pathogen genotype to 

15 the drug in question e.g. is fold change for the drug TDF (tenofovir) < 135? If yes, the 
left branch is taken, if no the right branch is taken. As with Ihe methods of the aspects 
of the invention described previously, the other &ctors queried include the log baseline 
pathogen load and the phenotypic sensitivity score. The numbers at the termini of the 
final branches represent the response rate (1 = 100% response). Examples of 

20 classification trees according to the invention are provided in Figures 8a, b and c. 

In this ^bodiment of Iffiib invention, the clinical cut-off is defined as the fold change ""^"^ 
resistance threshold value that makes the best distinction between successftil and 
unsuccessftil treatments i.e. the most suitable value posed in the query that bifurcates 
the tree into the left and right branches. The population is thus split into two subgroups: 
25 one with a high success rate and one with a low success rate. The clinical cut-off is 

chosen as the fold change that makes the difference between the two groups as large as 
possible. 

Preferably, two or all three of the methods of the above-described embodiments of the 
invention are performed for each dataset and candidate drug. The clinical cut-offs can 

30 in this manner be calculated for each of the approaches. From the analysis results, the 
most appropriate values for lower and higher cut-offe are selected, taking into account 
the advantages and the disadvantages of the separate approaches. This selection will 
only be made if the results of the approaches are consistent or if possible 
inconsistencies can be explained, ff there are unexplained inconsistencies between the 

35 results, it can be concluded that more data need to be gathered before a clinical cut-off 
can be determined. 
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For exainple^ if the results of the different approaches are consistent (preferably clinical 
cut-off difference < 0.5) then the predictions are deemed to be consistent. If the results 
differ more than that, the disparities need to be explained. For example, if we suppose 
that the population contains 90% censored values and the linear regression gives a 
5 clinical cut-off of 0.9 while the logistic regression gives a clinical cut-off of 3.5, then in 
this example the linear regression results are less reliable because too much correction 
has to be made for censoring and there is too little information contributed by 
"complete" observations. 

The models may be validated using bootstrapping or by repeating the described steps 
10 several times. 

Alternatively, the model n^y be validated by calculating a concordance index (c-index) 
(Hatrell F.E., Lee K.L. and Mark D.B. - Multivariable prognostic models: issues in 
developing models, evaluating assumptions and adequacy, and measuring and reducing 
errors - Statist. Med. 1996; 15:361-387) which describes how all flie models can 
1 5 discriminate between patients with a different response. 

This c-index may be calculated on the data set used for model development and on a 
further test data set. If the difference between the two c-indices is small, it means that 
the models do not lose their predictive ability if applied to new data. 

Further factors which may be taken into accoxmt when validating the model are the 
20 odds ratios determined using clinical cutoffs coxrpared to those determined using 

biological cutoffe. These ratios represent the odds of being a responder in the group that 
is labeled resistant by the cutofis divided by the odds of being a responder in the group 
that is labeled sensitive by the cutoff. In this way it is possible to evaluate how well 
the cutoffs perform on a drug by dmg basis. The fiirther away that the odds ratio is 
25 from 1, the stronger the correlation between the resistance class and climcal outcome. 
The odds ratio as specified above yield numbers smaller than 1 as the odds of being a 
responder should be smaller in the higher resistance class. Put another way, the 
probability of response will decrease as the resistance increases. Additionally, if the 
odds ratio for the CCO is smaller than the odds ratio for the BCO, it can be concluded 
30 that the CCO is stronger correlated with clinical outcome and therefore gives a better 
prediction than the BCO. 

It is also possible to study the difference in response rate for patients taking 1 active 
drug more vs. patients taking one active drag less. This, different type of odds ratio is 
the ratio of odds of response for people taking more active drags over the odds for 
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people taking less active drags. In Ibis case, the odds tatio should be > .1 as the 
probability of response will increase as the number of active drugs taken increases. In 
this case, a larger odds ratio indicates a stronger correlation with clinical outcome. In 
one dataset, using the CCOs set out on page 36 herein, the odds ratio for response per 
5 additional active drag added was 3,01 when calculated using clinical cutofife, and 2.32 
when calculated using biological cutoffs. 

A logistic regression model may be used to determine the odds ratio. The model used is 
the same as the logistic regression model desoibed above except that the fold change in 
the model is replaced by the resistance class. The coefficient for the resistance class 
10 that is obtained from the model is the log (odds ratio). The advantage of using this 

model is that Hie odds ratio estimates can be adjusted for the baseline viral load and the 
cPSS score of the background regimen. 

A number of embodiments are shown in the following table which gives values for 
lower and upper cutof & with confidence intervals modelled using defixution 2 and a 
15 linear regression model for several drugs, and validated using C-indices, and CCO odds 
ratios. 



Drug 


Modelled Lower Cutoff & 
95% Confidence Iniaval 


Modelled Upper Cutoff & 
95% Confidence Interval 


AZT 


1.9 [1.52-2.76] 


14.4 [8.24 -21.20] 


3TC 


1.1 [0.98-1.39] 


''^ 3.7 [1.71-11.44] 


D4T 


1.1 [1.05-1.12] 


2.2 [2.05 - 2.30] 


DDm 


1.3 [1.07-1.34] 


3.0 [2.59 - 2.967 


ABC 


0.S [0.75 -1.72] 


1.2 [1.19-5.11] 


TDF 


1.0 [0.97 -1.32] 


2.0 [1.51 -2.95] 


NVP 


1.5 [1.40- 16.74] 


3.2 [2.22 -63.06] 


EFV 


1.8 [1.41 -3.74] 


29.2 [6.46-146.69] 


IDV 


0.8 [0.77-1.04] 


2.2 [1.33-7.19] 


IDV/r 


4.1 [0.77-6.24] 


21.2 [1.41-22.86] 


APV 


0.7 [0.65 - 0.87] 


1.4 [1.03 - 5.55] 


APV/r 


0.9 [0.80-2.72] 


6.5 [4.06- 16.22] 


NFV 


1.0 [0.97 -1.03] 


1.5 [1.54 - 2.38] 


SQV 


0.7 [0.65 -2.28] 


1.0 [1.03 -22.61] 


SQVA- 


1.1 [0.81-5.98] 


12.0 [4.50 -28.85] 


LPV/r 


10.3 [1.53-17.30] 


61.6 [21.92 - 66.96] 
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The methods of the invention can be repeated for each possible drag or theiapeutic 
agent known or suspected to be associated with disease resistance, or towards which a 
resistance can be expected to appear. As such, according to another embodiment of the 
invention, the clinical cut-oflEs generated can be presented as a list of cut-oflfs against or 
5 in respect of individual drugs or individual therapeutic agents, for each pathogen. 

As used herdn, the term "drag" includes, but is not limited to, a pharmaceutical, 
bactericide, fungicide, antibiotic, or anticancer, antiviral, anti-bacterial anti-ftmgal, 
anti-parasitical or any other compound or composition that can be used in therapy or 
therapeutic treatment 

10 A "patient" may be any organism, particularly a human or other mammal, suffering 
from a disease or in need or desire of treatment for a disease. A patient includes any 
mammal, including farm animals or pets, and includes humans of any age or state of 
development A group of patients useful to establish treatment response as a function of 
the distribution of fold change resistances may be as low as 10 to 50 patients, 50 to 500 

15 patients, or, more preferably, will comprise a population of 500 or more patients. The 
distribution fold change resistances can be a normal distribution (Gaussian distribution) 
or can be a non-normal distribution. The non-normal distribution may be transformed 
to obtain a normal distribution. 

The patient samples may be from treatment naive or tceattnent experienced subjects, 
20 with or without resistance to one or more drags. 

As used herein, the term "disease" refers to a disease caused by infection with a 
pathogen. The term "pathogen", as used herein, is used broadly and refers not just to 
pathogenic microorganisms, but includes any disease-causing agent Examples include 
bacteria, vimses such as human immunodeficiency virus (HIV), hepatitis C (HCV) or 
25 hepatitis B (HBV), prions, algae, fungi, protozoa and malignant cells. This invention is 
particularly usefid for viral diseases such as HIV. 

A "patient sample" is herein defined as any sample obtained from an individual 
suffering from or predicted to be suffering fix>m a disease caused by a pathogen, and 
mcludes tissues such as blood, serum plasma, urine, saliva, semen, breast milk, feeces, 
30 mucous samples, cells in cell culture, cells which may be further cultured, biopsy 

samples and so on. In one embodiment, for a patient infected with HIV, any biological 
sample-containing vims may be used. Of this patient sample, the pathogen itself may 
be used or alternatively a protein, or nucleic acid derived from the pathogen. 
Preferably, the pathogen is a virus, such as a retrovirus. Preferably the biological 



wo 2005/086061 



PCT/EP2005/050888 



-19- 

sample contains a virus chosen ficom HIV, HCV (Hepatitis C Virus) and HBV 
(Hepatitis B virus). In another embodimenl^ for a cancer patient, the patient sample 
may contain cells, tissue cells, mutated cells, malignant cells, cancer cells, whole or 
partial tumours, biopsy tissue, etc. Preferably, the pathogen is a malignant cell. A 
5 "reference sample" is defined as a standard laboratory reference pathogen such as, for 
example, in the case of HIV, the HIV LAI TTTB strain. One strain generally used as the 
reference ''wild type" sequence for HTV is HXB2. This viral genome comprises 9718 
bp and has an accession number in Genbank at NCBI M38432 or K03455 (gi number: 
327742). Reference or wild type sequences for use in liie invention in the field of 
10 specific diseases, infections or diseases caused by specific pathogens can be easily 
obtained fix>m publicly available databases. 

"Susceptibility" or "sensitivity" to a dmg refers to the capacity of the disease, and/or 
pathogen to be affected by the dmg. "Resistance" refers to the degree to which the 
disease and/or pathogen is unaffected by the dmg. The sensitivity, susceptibihty or 

15 resistance of a disease towards a drag may be e:q)ressed by means of an ICso value. The 
IC50 value is the concentration at which a given drag results in a reduction of the 
pathogen's growth compared to the growth of the pathogen in the absence of a drag. 
Resistance of a disease to a drag may be caused by altemtions in phenotype or 
genotype. Genotypic alterations include mutations, single nucleotide polymorphisms, 

20 microsatellite variations, and/or epigenetic variations such as methylation. Phenotypic 
variations may be effected by genotypic variations or by post-translational 
modification. 

Any method capable of measuring changes in the ability of a pathogen to grow in the 
presence of a drag(s) can be used in the method of the present invention. Such methods 
25 of phenotyping include all methods known to persons of skill in the art Known 
genotyping methods may also be applicable. 

For example, and by way of illustration, methods for phenotyping bacteria suitable for 
use in the present invention include, but are not limited to, measurement of inhibitory 
zone diameters (see, e.g., Guoming et al. Sex Transm, Dis. 27 (2) : 115-8 (2000)), 
30 colorimetric indicator methods (see, e.g., Lozano-Chiu et al, Diagn Microbiol Infect 
Dis. 1998 Jul;31(3):417-24X and broth macrodilution method (see, e,g., Iwen et aL, J. 
Clin. MicrobioL 34 (7) : 1779-83 (1996)). 

As an additional illustrative example, methods for phenotyping pathogens suitable for 
use in the present invention include, but are not limited to, plaque reduction assays, 
35 PBMC p24 growth inhibition assays (see, e.g., Japour et al., Antimicrob Agents 
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Chemofher. 1993 May;37(5): 1095-101; Kusumi et aL, J, ViioL 66 : 875-885 (1992)), 
recoxribinaiit virus assays (see, e.g., KeUam & Larder, Antimicrob. Agents Chemother. 
38 : 23-30 (1994); and Pauwels et al, 2nd Intemational Workshop on HIV Drag 
Resistance and Treatment Strategies, Lake Maggiore, Italy. Abstr. 51 (1998)); the use 
5 of GFP as a marker to assess the susceptibility of anti-viral inhibitors (Marschall et aL, 
Institute of Clin, and MoL ViroL, University of ErlangerNuremberg, Schlobgarten, 
Germany); and cell culture assays (Hayden et al^ N. Eng. L Med. 321 : 1696-702 
(1989)). 

Though the invention may be used with any phenotype or genotype measuring test or 
assay that determines resistance, the following descriptions are designed to describe 
further possible applications of the invention. 

In one embodiment, the clinical cut-off values may be used in concert with direct 
phenotype assays, for example, Antivirogram™ (Virco, Inc.; WO 97/27480, US 
6,221,578). This assay is aphenotypic resistance assay that measures, in controlled 
laboratory conditions, the level of resistance of the HTV derived from an individual 
patient to each of the anti-HTV drugs currently available. The resistant 'Tjehaviour" of 
the vims may be the combined result of the effects of many different mutations and the 
conqplex interactions between them, including genetic changes that have not even been 
identified yet In other words, it is a direct measure of resistance. 

The^test provides a quantitative measure of viral resistance to all the available drags. 
This is expressed in terms of the IC50. This is then compared to the IC50 for ftilly 
sensitive, non-mutated *^vild-type" virus. The resistance of the sampled vims to each 
drag is then expressed in terms of a fold-change in IC50 compared to wild type. 

The addition of ^'clinical cut-o£&", as described in this application, to the report enables 
physicians to identify the drag(s) that are no longer clinically active and helps in the 
selection of the optimal combination of drags for the individual patient,.In one 
embodiment, the method of the present invention concerns a diagnostic tool for 
determining the resistance of a patient to at least one HIV drag comprising the clinical 
cut-off fold change resistance value for said at least one drag as determined herein. The 
diagnostic tool includes phenotypic resistance tests such as the Antivirogram®, 
VirtualPhenotyping® andPhenosense. 

The invention includes methods to determine resistance towards HIV compounds such 
as tenofovir, lopinavir, and those compounds disclosed in W099/67417, EP-A-945443 
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and WOOO/27825. Other examples of drugs will be well known to those of skill in the 
ait. 

In one embodiment the effect of drugs on HBV may be monitored using technologies 
such as disclosed by Isom et at (WO 99/37821, Delaney et al Antimicrob. Agents 
5 Chemotherap- 2001, 45 (6) 1705-1713)- 

In one embodiment the effect of drugs on HCV towards therapy may be determined 
using techniques such as described by Rice (WO 97108310, WO 98/39031) and 
Barthenschlager (EP 1043399). 

The piimaty aim of the invention is to predict the resistance of a disease to a particular 
10 dmg. In addition, however, the invention encompasses methods of evaluating currently 
applied dmgs and thus monitoring these drugs with a view to assessing the 
effectiveness of that drug and proposing altemative dmg(s) or optimizing the dmg if 
deemed appropriate. Such methods involve obtaining a sample containing a disease- 
causing pathogen from a patient, and then performing the steps described in any one or 
15 more of the embodiments of the invention described above. 

It will be apparent to the skilled reader tihiat while the invention has been described in 
the below examples with respect to virases, particularly HTV, the present invention has 
broad applicability to any disease state where it is desired to correlate genotypic 
information with phenotypic profiles and assess the threshold at which a fold change 

20 resistance is clinically significant One skilled in the art could readily take the 

Allowing discussion of the invention with the HIV virus and through the exercise of 
routine skill apply this invention to oth^ diseases (such as other viral infections, 
malignant cells, cancer, bacterial infections, other pathogens, and the like) to correlate 
genotypic information to predict phenotypic response, assess dmg resistance, and 

25 eventually develop a treatment regime of dmgs for a particular patient One skilled in 
ttie art will also know that many virus species comprise many strains; for instance, HIV 
comprises HIV-2 in addition to HTV-l and both groi:^s are further divided into groups 
(such as groups O and M for HIV-1). 

The above methods are diagnostic methods. Further asfpects of the invention provide 
30 diagnostic kits for performing any one of the diagnostic methods of the invention 
desoibed above. The invention ftirther relates to a diagnostic system as herein 
desodbed for use in any of the above described methods. 

According to yet another embodiment, the present invention relates to a diagnostic 
system fox predicting clinical response to a dmg of a disease causiag pathogen 
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conprising: a) means for obtainiag a genetic sequence of the disease producing 
pathogen; b) means for identifying at least one mutation in the genetic sequence of the 
disease producing pathogen; c) genotype database meaas comprising genotype entries; 
d) phenotype database means comprising phenotypes of patient fold change response 
5 values; e) clinical response database means comprising clinical response to dmg 
treatment data for reference saxnple patients; f) correlation means correlating a 
genolype entry with a phenotype, where the genotype entry corresponds with the 
obtained genetic sequence of the disease producing pathogen; g) means for modeling 
clinical response to a drug of the disease causing pathogen by determining whether Ihe 
10 patient fold change response is above a cut-ofif value^ wherein the cut-off value is 

determined using the clinical response database means and comprises the fold change 
response value at which a clinically relevant diminished clinical response is observed; 
and h) means for predicting the clinical response to a drug of a disease by determining 
whether the patient fold change response is above the cut-off value. 

15 As described above, the cut-ofif value is determined as a function of treatment response 
data in treated subjects, considering baseline pathogen load, baseline fold change 
resistance, baseline activity of co-administered dmgs targeted to the pathogen, and 
treatment history. The means for predicting Uie resistance are preferably computer 
means. 

20 A still finther aspect of the invention relates to a coxnputer apparatus or computer-based 
system adapted to perform any bne of the methods of the invention described above. 

In a preferred embodiment of the invention, said computer apparatus may comprise a 
processor means incorporating a memory means adapted for storing data; means for 
inputting data relating to the genotype exhibited by a particular disease causing 
25 pathogen; and computer software means stored in said computer memory that is 

adapted to perform a method according to any one of the embodiments of the invention 
described above and output a prediction of the resistance of a disease causing pathogen 
toward a dmg. 

A computer system of this aspect of the invention may comprise a centml processing 
30 unit; an input device for inputting requests; an output device; a memory; and at least 
one bus connecting the central processing unit, the memory, the input device and the 
output device. The memory should store a module that is configured so that upon 
receiving a request to model the response to a drug of a disease causing pathogen, it 
performs the steps listed in any one of the methods of the invention described above. 
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In fhe apparatus and systems of these embodiments of the invention, data may be input 
by downloading the data from a local site such as a memory or disk drive, or 
alternatively from a remote site accessed over a network such as the intemet. Data may 
be input by keyboard, if required. 

5 The generated results may be output in any convenient format, for example, to a 

prmter, a word processing program, a graphics viewing program or to a screen display 
device. Other convenient formats will be ^parent to the skilled reader. 

The means adapted to predict the resistance of a disease causing agent to a drug will 
preferably comprise computer software means. As the skilled reader will appreciate, 
10 once fhe novel and inventive teaching of ibe invention is appreciated, any number of 
different computer software means may be designed to implement this teaching. 

According to a still further aspect of the invention, there is provided a computer 
program product for use in conjunction with a computer, said computer program 
con^jrising a computer readable storage medium and a computer program mechanism 
15 embedded therein, the computer program mechanism comprising a module that is 

configured so that upon receiving a request to predict fhe resistance of a disease to a 
drug, it performs fhe steps listed in any one of fhe methods of fhe invention described 
above. 

The invention further relates to systems, computer program products, business 
20 methods, server side and client side systems and methods for generating, providing, and 
transmitting fhe results of fhe above methods. 

The invention will now be described by way of example with particular reference to a 
specific system that implements the process of fhe invention. As the skilled reader will 
appreciate, variations firom this specific illustrated embodiment are of course possible 
25 without departing fl-om the scope of fhe invention. 

Brief description of the Fibres 

Figure la: Example of the structure of a clinical data base used in the present 
invention. 

Pigure lb: Example analysis dataset for d4T. 
30 Figure 2a: Example of linear regression curve showing censored and uncensored 
observations, where log viral load drop is modelled as a function of 
baseline fold change resistance. 
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Figure 2b: Rsiample of linear regression curve of 8 week viral load lesponse as a 
function of baseline phenotypic resistance for nucleoside(tide) RT 
inhibitors. 

Figure 2c: Example of linear regression curve of 8 week viral load response as a 
5 fiinction of baseline phenolypic resistance for boosted and unboosted 

protease inhibitors. 

Figure 2d: Example of change in viral load vs, baseline fold change for the NNRTI of 
neviripine (MVP). 

Figure 2e: Example of change in viral load vs- baseline fold change for e&virenz 
10 (EFV)- 

Figure 2f: Drug Effect plotted as % response as a Function of Baseline Resistance for 

nucleoside(tide) inhibitors 
Figure 2g: Drug Effect plotted as % response as a Function of Baseline Resistance for 
boosted and un-boosted Protease Inhibitors 
15 Figure 3a: Example oflinear regression curve for TNF^ where log viral load drop is 
modelled as a function of baseline fold change resistance and a first 
definition of clinical cut-off is applied. 
Figure 3b: Ebcample of linear regression curve for d4T, where viral load drop is 

modelled as a function of fold change and a second definition of clinical 
20 cut-off is ^^lied. 

Figure 4a: Ex:ample of linear regression curve for TDF, where log viral load drop is 
^y- modelled as a function of baseline fold change resistance and a second 
definition of clinical cutnofif is applied. 
Figure 4b: Example of lower and upper cutoffs determined using definition 2 for 
25 lopinavir/r if viral load is modeled using linear regression. 

Figures 4c: Example of lower and upper cutoffi determined using definition 2 for AZT 
if viral load is modelled using linear regression. Curve shows Ihe change in 
viral load vs. the fold change. 
Figure 4d: Example of lower and upper cutoffs determined using definition 2 for AZT 
30 if viral load is modelled using linear regression. Curve shows the % loss of 

reference response vs. the fold change. 
Figure 5a: Example of logistic regression curve for TNF, where probabiUty of feilure 
is modelled as a function of baseline fold change resistance and a second 
definition of clinical cut-off is applied. 
35 Figure 5b: Example of lower and upper cutoffe determined using definition 2 for 
lopinavir/r if the j^lure rate is modelled using logistic regression. 
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Figure 6: Example of linear regression curve for TNF, where log viral load drop is 

modelled as a function of baseline fold change resistance and a third 

definition of clinical cut-off is applied. 
Figure 7: Example of logistic regression curve for TNF, where probabihty of feUure 

is modelled as a function of baseline fold change resistaace and a third 

definition of clinical cut-off is applied 
Figure 8a: Example of classification tree for TNF. This gives results of the same order 

as the linear and logistic regression methodologies. 
Figure 8b: Example of classification tree for 3TC- 

Figure 8c: Example of classification tree for TDF assuming that the cost of classifying 
a :&iled regimen as a success is 1.5 times the cost of classifying a successfiil 
regimen as a &ilure. 

Figure 9: Example of initial validation of preliminary VirtualPhenotype™ Clinical 

Cutofis for nuclesides(tides) and boosted and unboosted protease inhibitors 
by bootstrapping with 90% confidence intervals. 

Example: Process descilptloa of the determination of clinical cut-offs 
Step 1: Clinical DataBase and Analysis Data Set 

Databases of studies for patients with tenofovir containing regimens and consisting of 
patient baseline demographic characteristics, clinical outcome results with viral load 
and resistance data (Fold change), were retrieved and remapped according to a common 
structure allowing a meta-analysis. The structure consisted of baseline sequence, viral 
load data set, viral load measurements and sampling dates (for example viral load 
wifhiti 3 months of starting new regimen and viral load assessment 8 and/or 24 weeks 
after beginning new regimen), CD4+ data set which contains CD4+ counts and 
sampling dates^ resistance data set containing the fold changes to different antivirals 
and sampling dates; patient data set with patient information such as age, gender, race, 
treatment history; treatment data set with dmg regimens, start and stop dates, doses, 
formulations, frequency of intake, reghnen changes after resistance tests. The structure 
of such a clinical data base can be seen in figure la. The following table shows an 
example of characteristics of analysis datasets (8 week outcome) for individual dmgs. 





Range (Drug) 


Median Baseline Vital Load (log) 


3.32 (TDF) - 4.71 (boosted IDV) 


Median background cPSS 


1.34 (ddC) - 2.58 (LPV/r) 


# regimens including the drug 


24 (unboosted APV) - 1551 (3TC) 


% firom cohort data 


21% (unboosted APV) - 83% (ddl-EC) 


% with no resistance mutations 


14.5% (boosted APV) - 75% (EFV) 
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Figure lb shows an example analysis dataset for d4T. The viral load response data 
includes censored (<400 copies/ml) values. Parameters of the underlying uncensored 
distribution were estimated by maximum likelihood estimation in S AS (proc lifereg). 
5 The viral load response data are corrected for baseline viral load and cPSS. 
St^ 2: Modelling 

The clinical outcome results (drop in viral load and response rate) were modelled as a 
fimction of baseline fold change (FC) as determined by virtual phenotype (see 
WOOl/79540 and WO02/33402; also ht^://www.virco]ab-com). The models applied 
10 were linear regression, logistic regression, and a classification tree. These models also 
took into account effects of the concomitant HIV dmgs (PSS), baseline viral load 
(Baseline Log(Vl)i) and, optionally, treatment history in order to avoid bias introduced 
by imbalances of important characteristics. From the models, a prediction of clinical 
outcome could be made at different levels of the baseline fold change resistance, 

IS In the linear regression model, the proposed equation was Ibe following: 

LogVL drqpi = pO + pl*Baseline Log(Vl)i + P2 *PSSi + p3 *(l/FC)i + 

where i represented the patient, PO the intercept, pi, P 2 and p3 coefficients indicated 
the increase in log viral load drop per unit increase of respectively the baseline log VL, 
the number of sensitive dmgs in the background regimen and the inverse of the 
20 baseline fold change, was a random error term indicating the deviation of the patient 
i!t>fi:om the value predicted by the model. Interactions between all the lectors were 
evaluated and other baseline characteristics, i.e treatment history, were added if 
relevant. After applying the regression model, the curve as depicted in Figure 2a was 
obtained. 

25 Example curves showing linear regression models of 8 week viral load response as a 
function of baseline phenotypic resistance are shown in figures 2b and 2c. The curves 
, shown are for subjects with a log baseline viral load = 4.0 and a cESS score for the 
background regimen of 2,0, Figure 2b shows models for nucleoside(tide) RT inhibitors 
(firom top to bottom at FC=5, the respective curves represent d4T; TDF; ddl, EC; ABC; 

30 3TC and AZT). Figure 2c shows models for boosted and unboosted protease inhibitors 
(from top to bottom at FC=10 the respective curves represent: top left graph - NVP and 
LPV/r; top right graph - IDV and IDV/r; bottom right graph - SQV/r and SQV; bottom 
left graph - APV/r and APV) . 

A fimdamental issue with modeling clkdcal outcome for non-nucleoside(tide) RT 
35 inhibitors is that the baseline fold change may have litfle effect on treatment response 
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to current NNRHs in NNRH experienced patients. For the NNRTI of neviripine^ 
polarisation of fold change values is observed (see figure 2d). Purthermore, an 
extremely broad dispersal of fold change values is observed for efavirenz (see 
figure 2e). 

5 In a preferred embodiment of the linear regression model, more &ctors are included in 
order to obtain a more refined prediction of viral load response. These fectors include 
for example a sensitivity score per drug class in addition to the overall sensitivity score 
of the background treatment (cPSS), previous exposure to the drug (mSve (Yes/No), 
naive to Pis, naive to NRTPs, etc, ,-)- Furthermore, in the preferred embodiment, the 
10 fold change is transformed before inserting the figures into the modeL The 

transformation to the fold change comprises a power-transformation ranging firom FC"^ 
to FC^. In addition, a quadratic term in cPSS is preferably added. 

Accordingly, a more general form of the equation given above may be repressed as: 

wherein p is a power transformation (e.g. ranging from -3 to 1) and H5 to Ha are 
treatment history parameters (e.g. naive to antiretroviral therapy, naive to NRTI 
treatment, etc. or parameters describing the background therapy as a ftinction of a 
20 certain therapeutic class (e.g. the number of active NRTPs taken concomitantiy with the 
drag under investigation). 

tv 

Example curves showing linear regression models as calculated using this model are 
shown in Figures 2f and 2g. Here, the presentation of the data is different in that the % 
response is plotted, as calculated for the preferred CCO definition, rather than the viral 
25 load drop. 

In the logistic regression, the proposed equation was the following: 
Probof 

^^(Po + ^^LogiBaselimVL,)-\-^^iPSS,)+%i\IFC,)) 
(H-exp((io -t- ^^LogiBaselineVL,)^^:,iPSSi)^^^ilJ FC,))^^ 

30 

where pi, p2, and p3 represented the log odds ratio of success for the corresponding 
factors in the model. After applying the logistic regression model, the curves as 
depicted in Figures 5a and 7 were obtained In the classification trees model, tree- 
structured rules were constructed in order to classify patients in successes (undetectable 
35 viral load after treatment) and failures. The same parameters as for the other techniques 
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were considered. The tree shown in Figure 8 was obtained after applying the 
classification tree model 

When viral load results under the detection limits are obtained, biases could be 
introduced if the detection limit values are considered when calculating viral load drops 
5 and using those in the linear regression model. To avoid this, censoring needed to be 
taken into account and therefore the PROC UFEREG fecility in the SAS paciage was 
en^Ioyed. 

An advantage of this regression model is that it takes into account the maximum 
amount of information present in the data, i.e. correlating specific clinical responses 

1 0 with specific Fold changes while the oOxer two models clusters the patients in two 

groups (successes versus &ilures), thus not taking into account differ^ces in responses 
within the same group. Estimates are corrected for covadates in the model (e.g, 
background regimen) and therefore, they do not sniffer firom imbalances in the 
covariates. Conclusions are limited to patients with covariates that are represented in 

15 the clinical database. 

Logistic regression does not suffer ftom the censoring problem and the probability of 
success is an intuitive way of interpreting clinical outcome. However, by binning the 
viral load iato successes and failures, part of the information of the continuous variable 
viral load is lost. Estimates are also corrected for covariates as for linear regression. 

20 Classification trees are very visual and easy to interpret, but they have the disadvantage 
that the decisions do not take into account the value of other relevant parameters. This 
implies that imbalances lEbr other parameters may irdEIuence the decision taken for a 
certain parameter. However, they provide insights in the importance of several 
parameters and this can be helpftd in the fitting jxrocess of the other approaches. 

25 Figures 8a, 8b and 8c show examples of classification trees. Figure 8b shows a 
classification tree for 3TC, whilst figure 8c shows a classitication tree for TDF 
assuming that the cost of classifying a failed regimen as a success is 1.5 times the cost 
of classifying a successful regimen as a &ilure. 

30 Step 3: Determination of the clinical cut-offi 

Clinical responses were predicted in the models developed in previous step 2. The 
approach is to identify two cutojBfe per drug: a "lower^' cutoff which represents the fold 
change at which the response begins to be lost, and an *\ippef'' cutoff which represents 
the fold change at which the response is essentially gone. In order to determine the fold 
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chaages at which climcaQy lelevant diminished climcal responses can be observed, 
three definitions of clinical cut-o:l% were considered: 

Definition 1: 

Sensitive: predicted viral load drop is more than 0.6 logs. 
5 Xntermediate resistant: predicted viral load drop is between 0,2 and 0,6 logs. 
Resistant: predicted viral load drop is less than 0.2 logs. 

The lower and higher cutoffs are defined as the fold change with expected log viral 
load drops of >0.6 and <0.2 respectively. 

This definition of clinical cut-off addresses &e potency^ of an entire combination 
10 regimen and is highly dependent on the characteristics of the specific patient regimens 
analysed 

Definition 2: 

The maximum effect was defined as the treatment effect at fold change 1^ and the 
minimum effect was defined as Ihe treatment effect at a very high fold change (i.e. 
1 5 when the curve reached a plateau). The effect range was then the difference between 
the maximum effect and the minimum effect. 

Sensitive: the predicted treatment effect is more than 80% of the effect range. 

Intermediate resistant: the predicted treatment effect is between 20% and 80% of the 
effect range. 

20 Resistant: the predicted treatment effect is less than 20% of the effect range. 

The lower and higher cutoflfe are defined as the fold change associated with an 
expected 20% and 80% decrease respectively of the reference activity of the drug 
within the regimen. 

Cutoffs obtained using definition 2 do not address the potency of the entire treatment 
25 regimen, but rather give an estimation of the activity of the drug within the regimen. 
The absolute magnitude of the viral load drop depends on specific covariates. 

Definition 3: 

Definition 3 was a variant of definition 2. The lower cut-off was defined as the fold 
change that most optimally distinguished patients between successful and unsuccessful 
30 treatments. 

Using definition 3, breakpoints detennined by classification trees are applicable only to 
a subset of patients unless fold change is selected at the first tree node. Classification 
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trees ate easy to intexpret but the cutoff do not indicate the magnitude of the vital load 
teduction expected &r the whole regimen or the drug within the regimen. Breakpoints 
determined by the linear and the logistic regression models are close to the lower 
cutofis as defined by definition 2, except for the boosted protease inhibitors. 

S The methodology was applied fox Tenofovir on a population taking two active drugs 
besides tenofovir and with a baseline Log(Vl) of 4. 

When we applied definition 1 on the linear regression model, the observed drop in log 
viral load was -0.6 at fold change 3.73 (fig 3a). No higher cut-oiOr could be derived as 
this population experienced a drop in Log(VL) greater then 0,2 even with a high 
10 baseline fold change for tenofovir. This could be explained by the effect of the active 
background regimen in this population. 

When we applied definition 2 on the linear regression model (fig 4a), the observed drop 
in log viral load was -1 .48 at fold change 1, and -0.28 at the assymptotic fold change. 
Therefore the effect range was -0,28 + L48 =1 .2. 

15 20% of this effect range was observed at fold change 5 (and this value was considered 
as the upper clinical cut-off value). 

80% of the effect range was observed at fold change 1.25 (and this value was 
considered as the lower clinical cut-off value). 

To predict the resistance according to this regression model, we determined whether the 
20 patient fold change resistance was above, below, or in between the clinical cut-off as 
calculated according to definition 2. So, when the FC of patient was of 0,8 (below the 
lower clinical cut-off), a normal clinical response was predicted. If the FC of the 
patient was of 2 (above the lower clinical cut-off and below the upper clinical cut-off), 
a reduced clinical response was predicted. If the FC was of 7 (above the clinical cut- 
25 off), then the clinical response was ptedicted as being minimal. 

Definition 2 was also applied to the logistic model (fig, 5a) and this resulted in a lower 
cut-off at 1.2 FC and a higher cut-off at 3,81 FCThe results for definition 3 are 
depicted in fig 6 and 7. 

The Tenofovir results for the population with 2 active drugs in the regimen and a 
30 baseline Log(VL) of 4 are summarized in the Table below. 

From the Table, it can be derived that the lower cut-off for definition is 1 .2 and the 
higher cut-off ranges from 3.81 to 5 for the population of patients taking 2 active drugs 
besides tenofovir and with a baseline log(VL) of 4. The variation in cut-offs determined 
by the different definitions is a tesult of the different influence of the covariates such as 
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PSS and log VL. That is, the mfluence of the covariates is sigmficaat when using 
definition 1 and less significant when using definition 2. 

Definition 1 can only be applied on the linear regression model The clinical cut-offs 
determined using definition 1 are highly dependent on the characteristics of the 
5 subpopulation. This is due to the fact that definition 1 describes the potency of the 

whole drag regimen while definition 2 is related only to the activity of the drag under 
consideration and its resistance profile. In other words, the activity of the background 
regimen together with the drag under investigation determines the viral load drop that 
the patient will experience and hence the dependence of the cut-off on the background 
10 regimen. The activity of the background regimen does not change the resistance profile 
in a profound way, therefore the clinical cut-offs do not vary considerably with the 
population characteristics. 









Linear 


Logis0c 


CiasslficaUon 








Regre 


iSSfOfI 


Regn 


ission 


Tree 


Definition of 


















Clinical 






Lower 


Higher 


Lower Higher 


Lower Higher 


cut-off 


Population 


Properties of the subgroup 


CO 


CO 


CO 


CO 


CO 


CO 


Deffnition i 


Subgroup 1 


PSS^Z baseline Log(VL) = 4 


3.73 


> assay 
limit 


NA 


NA 


NA 


NA 




Subgroup 2 


PSS=0, baseline Log(VL) = 4 


1.68 


3.8 


NA 


NA 


NA 


NA 




Subgroup 3 


PSS-2, baseline Log(VL) = 5 


> assay 
limit 


> assay 
limit 


NA 


NA 


NA 


NA 




Overall 




NA 


NA 


NA 


NA 


NA 


NA 


OeHniiionZ 


Subgroup 1 


PSS~2, baseline Log(VL) = 4 


1-25 


5 


1.2 


3.81 


NA 


NA 




Subgroup 2 


PSS^, baseline Log(yL) = 4 


1.25 


5 


1.16 


3.36 


NA 


NA 




Subgroup 3 


PSS^Z baseline Log(VL) = 5 


1.25 


6 


1.17 


3.4 


NA 


NA 




Overall 




1.25 


5 


NA 


NA 


NA 


NA 


OeHnition 3 


Subgroup 1 


PSS=2, baseline Log(VL) = 4 


1.1 


6 


1.2 


3.81 


1.15 


NA 




Subgroup 2 


PSS=^, baseline Log(VL) = 4 


1.1 


5 


1.2 


3,36 


1.15 


NA 




Subgroup 3 


PSS=2, baseline Log(VL) = 6 


1.1 


5 


1.2 


3.4 


1.15 


NA 




Overall 




1.1 


5 


1.2 


NA 


1.15 


NA 


NA: Not Applicable 



15 Both Imear and logistic regression models give similar results for definition 2. 

When we applied definition 1 on the linear regression model for patients with a log 
baseline viral load of S and all patients taking two active drugs in addition to 
d4T(stavudine), a viral load drop of more than 0.6 log copies/mL for any fold change of 
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d4T is predicted (figure 3b), The viral load drop is predicted to be -0,6 logs and -0,2 
logs at fold changes 2.6 and 4.0 for patients with a log baseline viral load of S and 
taking no active drugs in addition to d4T (see figure 3b). 

Lower and upper cutofiSi determined using definition 2 for lopinavir/r are shown in 
5 figure 4b as 8 and 69 respectively for the whole population if viral load is modeled 

using linear regression, while the lower and upper cuttofs are 11 and 64, 10 and 60, and 
9 and 58 respectively for populations with log baseline VL/background PSS of 4/2, 5/0 
and 5/2 respectively if the feilure rate is modeled using logistic regression (see figure 
5b). The PSS is not statistically significant in this example. 

10 Lower and upper cutoffs determined using definition 2 for boosted saquinavir for the 

logistic model were 1 .7 and 13 .2, and 1 .7 and 12,9 respectively for populations with log 
baseline viral load/phenolypic sensitivity score for the background regimen of 4/2 and 
5/0 respectively. Lower and upper cutoffe by linear regression for saquiavir/r were 1.6 
and 12.3 respectively for Ihe whole population 

15 Lower and upper cutoffs determined using definition 2 for AZT are shown in figures 4c 
and 4d if viral load is modeled using linear regression. Figure 4c shows the change in 
viral load vs. the fold change whilst figure 4d shows the % loss of reference response 
vs. the fold change. 

Further tests with records for >1 3,000 patients yielded '-3 150 regimens with the 
20 required baseline and outcome variables, ranging fix>m 60 regimens including boosted 
SQV soft gel to 1546 iiibluding 3TC. Median log baseline viral load ranged fi-om 3.8 
(regimens with t^ofovir) to 4.7 (regimens with boosed indinavir). Median PSS of 
background regimens was 2 (range 0-7), The following table shows results of 
preliminary FiWwa/Phenotype™ Clinical Cutoffs for nucleosides(tides) and boosted 
25 and unboosted protease inhibitors from a preliminary linear regression analysis. The 
fold change values associated with a 20% diminution of virologic response at 8 weeks 
and an 80% diminution of virologic response compared to the maximal response are 
shown with a 95% confidence interval in square brackets. 



DRUG 


WRTtMLPHENOTYPE™ 
PREDICTED FC OF WILD 
TYPE CUNICAL ISOLATES 


VIROLOGIC RESPONSE 


20% REDUCTION 


80% REDUCTION 


AZT 
zidovudine 


0.8 


1.8 
[1.5-2.5] 


17 
[10-25] 


3TC 
lamivudine 


0.8 


1.1 
[1.1-1.21 


2.6 
[1.9-4.6] 
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w/^toalphenotype"^" 


VIROLOGIC RESPONSE 


DRUG 


PRcDIUTcLI rO Qr WILLI 








TVDC 1^1 IMII^AI ICOl ATPQ 
1 YrC OL-imOMI. loV/L-MI CO 


20% REDUCTION 


80% REDUCTION 


D4T 


0.7 


1.3 


3.4 


sxavuuinB 




[1.2-1.4] 


[3.1-3.6] 


ddl 


0.6 


1.3 


3.6 


aiaanosine 




[1.2-1.9] 


[2.8-4.9] 


(extended release) 










0.6 


1.6 


5.8 


OUOlV^Cl V II 




[1.1-2.6] 


[1.7-7.4] 


TDF 


0.8 


1.2 


2.5 


VSl IVIWII 




[1.1-1.5] 


[1.7-3.8] 


IDV 


0.7 


1.2 


3.4 


11 iVJil lOVIi 




[1.1-1.9] 


[1.9-16.4] 


i\/n/r 




3.5 


25 


Inrlinnvir/r 




[1-1-8.4] 


[1.8-31] 


1^1^ V 


0.9 


1.1 


2.2 


1 IsStt II ICIVII 




[1.1-1.3] 


[1.7-5.3] 




0.6 


1.1 


2.0 


OGl\^UII ICIVII 




[1.1-2.1] 


[1.7-18] 


SQV/r 




1.6 


12 


Saquinavir/r 




[1.3-4.8] 


[5.8-27] 


AMP 


0.6 


1.2 


3.4 


amprenavir 


[1.1-2.4] 


[1.7-10.2] 


AMP/r 




1.5 


6.8 


amprenavir/r 


■■•if 


[1.2-2.6] 


[3.6-10.5] 


LPV/r 


0.8 


6.9 


56 


Lopinavir/r 




[2.1-17.4] 


[29-67] 



Similar values were detemuned in logistic regression models. While the magnitude of 
the virologic response for individual patients is affected by covariates such as viral load 
and PSS, FC values associated with fiactions of the effect range are not 

5 Further test results showed that the clinical cutoffs for 20% reduced response after 8 
weeks were low (LI to 1.2 for unboosted Pis), but higher than Virco type predicted 
fold changes for wild type clinical isolates(0.73 (ABC) to 1.07 (AZT) for NRTIs, 0.63 
(APV) to 0.87 (NFV) for Pis). Clinical cutoffs for 80% reduced response were 
3.4(APV, IDV) for unboosted Pis. Clinical cutoffs for boosted Pis were higher: 1 .5 
10 (APV/r) for 20% reduction and 6.8 (APV/r) for 80% reduction. Among the treatment 

regimens analysed, sensitivity classes delBned by these clinical cutoffs showed different 
rates of virologic response (viral load drop of more than 1,0 log for BQL at 8 weeks) to 
regimens including the drag: 70 to 92% for fold changes less than the lower clinical 
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cutoff, 39 to 68% for fold clianges between the upper and lower clinical cutofi^, and 18 
to 50% for fold changes above the upper clinical cutoff. 

In another example the following linear regression model is used for d4T with clinical 
cutoff definition 2 when Log(BaselineVL) = 4^ lower reference fold change =0.9 and 
5 upper reference fold change=3, cPSS=2^ PSS|lJRTr|=l andNRTI[naive]=0, usmg the 
following linear regression: 

VLdrop = 2.91 - 0.63Log(BaselineVL) - 1.66(FC^'^) - 0.99(cPSS) + 0J5(cPSS^) 

- 0J8(PSS[NRTI]) + 0.91(NRTI[nawe]). 

The NRTI[naive] value represents whether Ihe patient is nmve to nucleside RT 
10 inhibitors. If yes, value =1, if no, value =0, The PSS[NRTI} represents the phenotjrpic 
sensitivity score for NRTIs, i,e the number of active NRTIs in the background regimen 
for the patient. 

Stage 1: Calculate VLdrop at the lower reference FC (drug is folly active) and the VL 
drop at the upper reference FC (minimal activity of the drag). This leads to a lower VL 
15 drop of —2,94 and an ttpper VL drop of —2.03. 

Stage 2: Calculate VL drop when drug has lost 20% of its activity and the VL drop 
when drug has lost 80% of its activity. This gives values of VL drop=-2.76 for 20% and 
-2.21 for 80% loss of activity. 

Stage 3: Determine Fold change equivalent to VL drop at 20% and 80% loss of activity 
20 by inserting values for VL drop into the above equation and calculatmg FC. This gives 
FC values of 1.08 and 2.18. Consequently, the lower and upper clinical cutoff values 
for d4T ate modeled as 1.1 and 2.2 respectively. 

Step 4: Validation of the cut-offs 

The models were validated using bootstrapping and repeating the steps described above 
25 several times. Bootstrapping is a resampling technique in which pseudo-populations of 
the same size as the original population are created by randomly drawing samples from 
the original population. Analysis of each of these populations gives a sense of the 
sampling variability of the clinical cut-off. 

Figure 9 shows initial validation of preliminary F/r^a/Phenotype™ Clinical Cutoffs 
30 for nuclesides(tides) and boosted and unboosted protease inhibitors by bootstrapping 
with 90% confidence intervals. 

The problem is tackled from different points of view in order to assess the robustness of 
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the analysis results. The climcal cut-offs obtained could be further refined by adding 
more data sets and by taking more characteristics of the patients into account The 
clinical-cutoflfe obtained could also be further refined by performing the model on 
unseen data. 

5 An alternative method of validation of the model includes calculation of a concordance 
index (c-index) which describes how all the models can discriminate between patients 
with a different response. This c-index is calculated on the data set used for model 
development and on a test data set. If the difference between the two c-indices is smaU, 
it means that the models do not lose their predictive ability if applied to new data. The 

10 following table contains information regarding the validation procedure for each dmg 
tested when modelled using the preferred embodiment of linear regression which 
includes the additional &ctors of sensitivity dmg per class in addition to the overall 
sensitivity score of the background treatment and previous exposure to the drug (e.g. 
naive, naive to Pis, naive to NRTIs). The test data c-index and the validation data c- 

15 index columns relate to the c-indicies for the original test data set and the new data set 
for validation purposes. The modeled lower and upper values of clinical cutoff are 
quoted with corresponding confidence intervals. 



Drug 


Test Data 


Validation Data 


iUlodelled Lower 
Cutoff & 
Confidence 
Interval 


Modelled Upper 
Cutoff & 
Confidence 
Interval 


C-index 


ceo 

Odds 
Ratio 


BCO 

Odds 

Ratio 


C-lndex 


ceo 

Odds 
Ratio 


BCO 
Odds 
Ratio 


ACT 


0.79 


0.065 


0.107 


0.78 


0.252 


0.316 


1.9 [1.52-2.76] 


14.4 [8.24 - 21 .20] 


3TC 


0.79 


0.160 


0.202 


0.79 


0.265 


0.284 


1.1 [0.98-1.39] 


3.7 [1.71-11.44] 


D4T 


0.77 


0.113 


0.239 


0.76 


0.366 


0.377 


1.1 [1-05-1.121 


2.2 [2.05 -2.30] 


DDIE 


0.73 


0.219 


0.379 


0.71 


0.119 


0.159 


1.3 [1,07-1.34] 


3.0 [2.59 -2.96] 


ABC 


0.70 


0.651 


0,359 


0.69 


0.544 


0.380 


0.8 [0.75 -1.72] 


1.2 [1.19 -6.11] 


TDF 


0.71 


0.228 


0.570 


0.68 


0.226 




1.0 [0.97 -1.32] 


2.0 [1.51-2.95] 


NVP 


0.79 


0.282 


0.290 


0.83 


0.054 


0.064 


1-5 [1.40 -16.74] 


3.2 [2.22 - 63.06] 


EFV 


0.79 


0.079 


0.099 


0.79 


0.041 


0.042 


1.8 [1.41-3.74] 


29.2 [6.46 -146.69] 


IDV 


0.76 


0.098 


0.194 


0.79 


0.270 


0.598 


0.8 [0.77-1.04] 


2.2 [1.33-7.19] 


IDV/r 


0.71 


0.027 


0.122 


0.63 


0.012 


0.038 


4.1 [0.77 - 6.24] 


21.2 [1.41-22.86] 


APV 


0.86 


0.051 


0.061 








0.7 [0.65 -0.87] 


1.4 [1.03 -5.55] 


APV/r 


0.73 


0.005 


0.053 








0.9 [0.80 -2.72] 


6.5 [4.06 -16.22] 


NFV 


0.76 


0.177 


0.291 


0.77 


1.068 


0.938 


1.0 [0.97-1.03] 


1.5 [1.54 - 2.38] 


SQV 


0.77 


0.332 


0.431 


0.56 






0.7 [0.65 -2.28] 


1.0 [1.03-22.61] 


SQV/r 


0.71 


0.165 


0.249 


0.81 


0.054 


0.080 


1.1 [0.81-5.98] 


12.0 [4.50 - 28.85] 


LPV/r 


0-70 


0.074 


0.240 


0.75 


0.002 


0.277 


10.3 [1.53-17.30] 


61.6 [21.92 -66.96] 
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The colunms beaded CCO Odds Ratio and BCO Odds Ratio represent the odds of 
being a responder in the group that is labeled resistance by the cutoff divided by the 
odds of being a responder in the group that is labeled sensitive by the cutoffi. For 
5 example, if there are four non-responders and one responder in the resistant group, and 
four responders and two non-responders in the sensitive group, the odds of being a 
responder in the resistant group is 1:4 = 0.25 and the odds of being a responder in the 
sensitive group is 4:2 = 2. This means that the odds ratio is 0.25/2 = 0.125. In other 
words flie odds of being a responder in the resistant group are 0.125 times the odds of 
10 being a responder ia the sensitive group. The further away that the odds ratio is from 1, 
the stronger the correlation betwe^ the resistance class and climcal outcome. 

The separation in groups (resistance/sensitive) can be done based on climcal cutoffe or 
biological cutoff. This means that odds ratios based on clinical cutoffs (CCO) and 
odds ratios based on biological cuto^ (BCO) can be compared. If the odds ratio for the 
15 CCO is smaller than the odds ratio for the BCO, it can be concluded that the CCO gives 
a better prediction than the BCO. 

In practice, a logistic regression model is used to determine Ihe odds ratio. The model 
used is similar to logistic regression model described earlier except that the fold change 
in the model is replaced by the resistance class. The coefficient for the resistance class 

20 that is obtained from the model is the log (odds ratio). The advantage of using this 

model is that the odds ratio estimates can be adjusted for the baseline viral load and the 
cPSS score of the background regimen. In one dataset, using the CCOs set out on page 
36 herein, the odds ratio for response per additional active drug added was 3.01 when 
calculated using clinical cutoffe, and 2.32 when calculated using biological cutoffs. 

25 These odds ratios are ratios of odds of response for people taking more active dmgs 
over the odds for people taking less active dmgs. The odds ratio here should be > 1 as 
the probability of response will increase as the number of active dmgs taken increases. 
In this case, a larger odds ratio indicates a stronger correlation wi1h clinical outcome. 



